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Abstract 

We propose a purely combinatorial algorithm for max fc -vertex COVER in bipar¬ 
tite graphs, achieving approximation ratio 0.7. The only combinatorial algorithms 
currently known until now for this problem are the natural greedy algorithm, that 
achieves ratio (®“i)/e = 0.632, and an easy 2 / 3 -approximation algorithm presented 

ini. 


1 Introduction 

In the MAX /c- VERTEX COVER problem, a graph G = {V,E) with \V\ = n and \E\ = m is 
given together with an integer k ^ n. The goal is to find a subset A C U with k elements 
such that the total number of edges covered by K is maximized. This problem is NP-hard 
even in bipartite graphs mM- 

The approximation of MAX /c- vertex cover has been originally studied in [6] , where 
an approximation 1 — i/e was proved, achieved by the natural greedy algorithm. This 
ratio is tight even in bipartite graphs [3] . In [T] , using a sophisticated linear programming 
method, the approximation ratio for MAX /c- vertex cover is improved up to 3/4. Finally, 
by an easy reduction from Mm Vertex Cover, it can be shown that MAX fe -vertex 
COVER in general graphs does not admit a polynomial time approximation schema (PTAS), 
unless P = NP [7j . 

Obviously, the result of [T] immediately applies to the case of bipartite graphs. Very 
recently, [5] has improved this ratio in bipartite graphs up to s/g, always using involved lin¬ 
ear programming techniques, but the existence of a PTAS for such graphs always remains 
open. 

Finally, let us note that MAX fc-VERTEX cover is polynomial in regular bipartite 
graphs or in semi-regular ones, where the vertices of each color class have the same degree. 
Indeed, in both cases it suffices to chose k vertices in the color class of maximum degree. 

Our principal motivation for this paper is to study in what extent combinatorial meth¬ 
ods for MAX fc-VERTEX COVER compete with linear programming ones. In other words, 
what is the ratios’ level, a purely combinatorial algorithm can guarantee? In this purpose, 
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we devise an algorithm that builds five distinct solutions and returns the best among 
them; for this algorithm, we prove a worst case 0.7-approximation ratio. Let us note that 
a similar issue is presented in [5] for MAX CUT where a 0.531-ratio combinatorial algorithm 
is given. Comparison of classes of methods with respect to their abilities to solve problems 
seems to be a very interesting research issue. This may bring new insights to both the 
problems handled and the methods themselves. Furthermore, such studies may exhibit 
interesting and funny mathematical problems. 

Note finally that in [3], an easy 2 / 3 -approximation algorithm is presented, together 
with a very complex one where a computer assisted analysis was giving a ratio of 0.792. 
But this ratio is impossible to be proved analytically. 

2 Preliminaries 

Consider a bipartite graph B{Vi,V 2 , E), fix an optimal solution O for MAX fe -vertex 
COVER (i.e., a vertex-set on k vertices covering a maximum number of edges in E) and 
denote by ki and k 2 the cardinalities of the subsets Oi and O 2 of O lying in the color- 
classes Vi and V 2 , respectively. W.l.o.g., we assume fci ^ ^2 and we set: 

ki = n- k2, ( 1 ) 

k = ki + k2 = {l + fi)-k2 (2) 

Denote by <5(17'), V' C V = Fi U V 2 , the number of edges covered by V and by opt(i?) 
the value of an optimal solution (i.e., the number of edges covered by O). 

Let S'*, i = 1, 2, be the ki vertices of F that cover the most of edges. Obviously, Si is 
the set of the ki largest degree vertices in F (breaking ties arbitrarily) and the following 
hold: 


5(5i) ^ 6{Oi) (3) 

<^(^ 2 ) ^ 5{02) 

In what follows, we call “best” vertices, a set of vertices that cover the most of uncovered 
edge^in B. Furthermore, we will also use the following additional notations and conven¬ 
tions (we assume that vertices in both Fi and F are ordered in decreasing degree order), 
where all the greek letters used imply parameters that are all smaller than, or equal to, 1: 

• (5(Oi): the number of edges covered by Oi; for conciseness we set (^(Oi) = a-opt(i?); 

• Pi ■ 6 (Oi) = Pi ■ a ■ opt{B): the number of edges covered by S*! n Oi; 

• d'{ 02 )'- the number of private edges covered by O 2 , i.e., the edges already covered 
by Oi are not counted up to 6'{02)] obviously, 6'{02) = (1 — a) • opt(7?); 

• 9 ■ 6{Oi): the number of edges (if any) from Oi that go “below” O 2 (recall Fi and F 2 
are ordered in decreasing degree order); 

• 7 • 5 ^( 02 ): symmetrically, it denotes the number of edges of O 2 that go below the 
vertices of Oi; 

• C'<^(Oi): suppose that after taking the k best vertices of Fi, there still remain, say, k'l 
vertices of Oi that have not been encountered yet; then, • (5(Oi) is the number of 
edges covered by those vertices; 

^For instance, saying “we take 5i plus the fe2 best vertices in V2, this means that we take Si and then ^2 
vertices of highest degree in B[{Vi \ Si), F2]. 
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• A • 6 '{ 02 )- this is the symmetric of the quantity C • <^( 01 ) for the pair (V 2 , 02 ) 
(supposing that the number of vertices in O 2 that have not been encountered is A:^)- 

In Figure [T| the edge-sets defined by the parameters above are iiiustrated. Heavy iines 
within rectangies Vi and V 2 represent the borders of and S 2 (the upper ones) and those 
of the k best vertices (the iower ones). Edges from Oi (arg((5(Oi))) are not shown in the 
the figure. They can go everywhere in V 2 . Private edges of O 2 (arg(( 5 '( 02 ))) are shown as 
heavy iack iines (the set of edges 62 )- They can go everywhere in Vi \ Oi. 




0 


Oi 


Vi 


F2 


02 



Figure 1: The edge-sets induced by the severai parameters. 


The basic idea of the aigorithm is quite simpie. It computes the best among five solu¬ 
tions built. It is presented in Section]^ while, in Section]^ we show that its approximation 
ratio is bounded below by 0.7. 

3 The algorithm ... 

The algorithm guesses the cardinalities ki, k 2 of Oi and O 2 , respectively, builds the five 
solutions specified just below and returns the best among them: 

SOLi: take 5i plus the k 2 remaining best vertices from V 2 ; 

SOL 2 : take S 2 plus the ki remaining best vertices from Hi; 

SOL 3 : take the k best vertices of V 2 ; 

SOL 4 : take the best between the following two solutions: 

1 . the k best vertices of Hi; 
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2 . the best 2 • ki vertices of Vi plus the remaining k — 2 ■ ki best vertices of V 2 . 

Let us note that the algorithm above, since it runs for any value of ki and k 2 , it will run 
for ki = k and k 2 = k. So, it is optimal for the instances of [3], where the greedy algorithm 
attains the ratio (e-i)/e. 

In what follows in this section, in Lemmata to we analyze the solutions built by 
the algorithm and provide several expressions for the ratios achieved by each of them. All 
these ratios are expressed as functions of the parameters specified in Section In order 
to simplify notations from now on we shall write opt instead of opt{B). 

Lemma 1. The approximation ratio achieved by solution SOLi is the maximum of the 
following quantities: 

1 — a + /3i ■ a 
a + 7 • (1 — a) 

Furthermore, if Si and Oi coincide (i.e., n Oi = Si), SOLi is optimal. 

Proof. For Q, Si covers, by more than (5(Oi) = a-opt edges. Decompose this edge- 
set into a set X of edges covered by Si \ {Si n Oi) and the set of edges of size fii ■ a ■ opt 
of edges covered by S'! n Oi. On the other hand, the /c 2 remaining best vertices in V 2 
will cover more edges than the ^2 remaining best vertices in O 2 ) that cover more than 
(1 — a) • opt — |X| edges, qed. 

For ([^, whenever Si does not coincide with Oi, there are vertices of Oi that ly 
below Si. Since 'y ■ 5'{O 2 ) is the number of edges from O 2 that go belong Oi, these edges 
will be not counted up in the set of edges covered by ^i. 

Finally, if 5i and Oi coincide, SOLi will cover a ■ opt + (1 — a) • opt = opt edges. | 

Lemma 2. The approximation ratio achieved by solution SOL 2 is bounded below by: 

1 — a + a ■ 6 ( 6 ) 

Proof. The proofs is similar with the one of Lemma 0 for 0. I 

Lemma 3. The approximation ratio achieved by solution SOL 3 is the maximum of the 
following quantities: 

l-X-{l-a)-a-e (7) 

(1 - a) • (I + A • /r) ( 8 ) 

Proof. If after taking the k best vertices of V 2 the whole of O 2 has been encountered, all 
but 9 -( 1 ( 01 ) edges of the optimum have been covered. In this case, an appoximation ratio 
I — a • 0 is achieved. 

Otherwise, by the definition of A • < 5 ^( 02 ): 

opt — X ■ 6' (O 2 ) — 6 ■ 6 (Oi) = opt ■ {1 — X ■ {1 — a) — 9 ■ a) 
edges of the optimum are covered. 

On the other hand, taking the k best vertices of V 2 ) consists of first taking S 2 (covering 
{1 — a) ■ opt edges) and then the ki best vertices below it. Furthermore, below the k best 
vertices, the group of the /c 2 “worst” vertices of O 2 has average degree at least 
Since the algorithm takes ki “better” vertices, they will cover at least: 

“4 • A • (5' (O 2 ) ^ 7 ^ • A • (1 — a) • opt ^ /r • A • (1 — a) • opt 

which proves Q. | 


(4) 

(5) 
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Lemma 4 . The approximation ratio achieved by solution SOL4 is the maximum of the 
following quantities: 


1 — C • a — 7 • (1 — a) (9) 

( 2 -/ 3 ,).a+ (10) 

jl 

(1 - /x) + a • (2 + /i) - g • / 3 i {I - 2 ■ p,) ■ ( ■ a 

2 2 ■ p 

Proof. Let us first note that, if after taking the k best vertices in Vi all the vertices of Oi 
are captured, the approximation ratio achieved is 1 — 7 • (1 — a) since only 7 • 6'{02) = 
7 • (1 — a) • opt edges of the optimum are not covered. Suppose now that k'l verices of Oi 
are not captured. In this case, the k vertices taken from Vi cover: 


opt — ( ■ 5 (Oi) — 7 • (O2) = (1 — C'O — 7'(1 — Oi)) ■ opt 

For ( fTo] ) and © now, observe hrst that the k vertices taken from Vi can be seen as 

the union of ^/fci consecutive /ci-groups (called clusters in what follows) and that, by 0 

and 0, fc/fci = 1+M//J. Assume also that the k — k'l of Oi encountered among the k best 

vertices of Vi are included in the vr first clusters. Denote by Ki the number of vertices of Oi 
in the i-th cluster, i = 1 ,... , 7 r, and suppose that the “optimal” Ki vertices of cluster i 
cover Pi ■ (i(Oi) = Pi ■ a ■ opt edges. 

Claim 1. Consider cluster i and denote by Oi^i the part of Oi not captured by clusters 
1 , 2 , ...,z — 1 (so, Oi^i = Yl]=i^j + ^i)- Then, the vertices of cluster i will cover at 
least (1 — Pj) ' Oi • opt edges. 

In order to prove Claim observe that the part of ( 5 (Oi) covered by Oi^i is: 

i=i V i=i 


i—1 

6{Opi)=6iOi)-T 


^ •a • opt 


and that the ( 5 (Oi^i) edges are covered by Yl]=i = ki — (X]j=i ^j) ^ vertices, 

while cluster i contains exactly ki vertices with degree at least as large as those of Oi,i. 
An easy average argument derives then that the vertices of cluster i will cover at least: 


(1 - EJ.l ft) ■ a ■ opt 



• a ■ opt 


edges, qed. 

Consider the two first groups clusters taken from Vi. The hrst of them ( 5 i) covers 
more than 5 (Oi) = a • opt edges (by 0 ) while, by Claim the second one will cover 
more than (^i/fci-Ki) • (1 — Pi) ■ 6{Oi) ^ (1 — Pi) ■ a ■ opt edges. Observe also that, by Q 
and In any of the remaining — 2 = clusters, their vertices 

obviously cover more than C • 5 (Oi) = C, - a - opt edges (indeed, by the average argument 
of Claimmore than ki ■ C-< 5 (Oi)/fc' ^ Q ■ ( 5 (Oi))). We so have: 


ISOL4I ^ 


(2 — Pi) + 


1 - h 
h 



HOi) 


(2-Pi) + ^-C 


■ a • opt 


that proves (10). 
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Let us now get some more insight in the value of SOL 4 . By extending the discussion 
just above, the ki vertices of cluster i will cover more than: 


ki 


i—1 


2 — 1 
ki- Y. 

j=l 


i=i 


( 12 ) 


Furthermore, as seen previously, all clusters below the vr first ones containing the ki — k[ 
captured vertices of Oi, will cover more than • (5(Oi) each. 

Hence, summing (12) for i = 1 to vr, taking into account the remark just above, and 

1 + /r 


setting /3o = 0 , the following holds: 

^TT — 1 


ISOL 41 ^ IE + 

i =0 \ j=0 


TT—l 


= TT - 


Pi + 


2=1 


1 + /i 


-ttJ -Cj 

-ttVc) -MOi) 


TT—l TT—l 


= TT — TT 


• Pi + i ■ Pi + 


i=l i=l 


1 + /i 


-TT -C -MOl) (13) 


Observe now that: 


TT—l 


vr-J^A = 7r.(l-/3,-C)-5 (Oi) 


(14) 


2 = 1 


and combine (14) with (13). Then, the latter becomes: 


ISOL 4 I ^ |^7r-7r-(l-/3^-C) + E*-A+(^-^) •'^(Oi) 


V 2=1 


El * ■ A + 


^ 2=1 


1 + /r 


c -<5(01) 


^A + E/^* + E(*- 2 )-A + —-c) --5(01) 

^^^=1 i=2 2=3 ^ / 

(1 - C) + (1 - /5i - C) + E(* - 2) • A + ^ (Oi) 
(2 - /3i) + ^(i - 2) • /3i + • cV 5 (Oi) 


i=3 


^ — Pi) + E 'Pi + 


i=3 


1 -/^ 




C -5(01) 


(15) 


Set Yi= 3 'Pi ■ '^( 1 ^ 1 ) = These edges are covered by both Oi and SOL 4 . Then, (15) 
becomes: 


1 -/i 




ISOL 4 I ^ {2-Pi) + -^-C •5(Oi) + X= (2-/3i) +-.a-opt + X (16) 


1 -// 


k 
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On the other hand, consider Item in SOL 4 . The 2 • ki best vertices from Vi cover 
(1 — C) • ~ ^ edges of the optimum. Let T + [(1 — (") • < 5 ( 01 ) ~ X] be the total 

number of edges covered by those vertices. Then, best k — 2 ■ ki vertices of V 2 will cover 
at least as many edges as the k — 2 ■ ki best vertices of O 2 , that will cover at least 
{{k-'i-ki)jk 2 ) • 5'{02) — Y. Putting all this together, we get: 

ISOL 4 I ^ y + [(1 - 06 { 0 i) + ■ 5 ' (O 2 ) - 

k2 

= ^(1 - C) • a + -— ^ ■ (1 - a)^ • opt - X 

0J1 ((1 _ . Q, _|_ (1 _ . (1 _ o;)) . opt - X (17) 


Expression (16) is increasing with X, while © is decreasing. Equality of them, leads 
after some easy algebra to: 


X = 


{I - n) - a ■ {2 - fi) + f3i ■ a C ' « 
2 2 ^ 


opt 


(18) 


Embedding (18) to (16) and dividing the ratio obtained by opt, derives the ratio claimed 
by (0.1 


4 ... and its approximation ratio 

The objective of this section is to prove the following theorem. 

Theorem 1. MAX A:-vertex cover is comhinatorially approximable within ratio 0.7. 

Proof. Eor the proof we propose an exhaustive parameter-elimination method (very prob¬ 
ably non-optimal) that has the advantage to be quite simple. It consists of subsequently 
eliminating parameters from the ratios proved in Lemmata to until two ratios that are 
only functions of /r are got. These ratios have opposite monotonies with respect to this 
parameter, hence by equalizing them we determine a lower bound for the overall ratio of 
the algorithm. 

Elimination of 9: ratios ([^ and ([^ 

Equalizing ratios given by ([^ and ([^ leads to 2a ■ 9 = a — X ■ {1 — a) ^ a ■ 9 
and embedding it in (j^ derives: 

2 - a • (1 - A) - A 
2 


= a-X-(l-a/2 


(19) 


Elimination of A: ratios 

Equalizing ratios given by (| 
derives: 


]) and ( [T^ 

and ( [T^ gives A = “/(i-Q;)-(i+ 2 -/i). This, together with 
1 -l- /i 


1 — a 


1 -h 2 ■ fJ. 


( 20 ) 


Elimination of 7 : ratios ([^ and ([^ 

It gives 7 = and the ratio obtained is: 

1 + a — ( ■ a 
2 


( 21 ) 
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Elimination of ratios ( [Tol ) and Q 

We have: 

{1 — ^j) ■ Q ■ a I + a — C ■ a 


(2 — /3i) • a + 

2 — fj, 


^ C = 


2 

('•0 = 1 + 0 — 2 -(2 — / 3 i) - 0 = 1 — 3 -o + 2 -/?i-o 




1 — 3 -o + 2 -/ 3 i-o 


2 — fj, 


a 


( 22 ) 


and embedding ( 22 ) in ( 10 ), we get: 


(2-/3i)-o + ^-(l-o-(3-2-/3i)) 

(2 - //) • (2 - /3i) • o + (1 - /x) - o • (1 - /x) • (3 - 2 • /3i) 

2 — /X 

(1 - /x) + o • (1 + /X - /X • /3i) 


( 23 ) 


2 — ^ 

First elimination of /3i: ratios (|^ and (23) 

We have: 

(l-/x) + o-(l + /x-^-/3i) 

1 - O + /3i • O = --- 

2 — fi 

=> 2 • /?! • o = (1 — /x) + o • (1 + /x) — (2 — ^) + o • (2 — /x) = —1 + 3 • o 

^3-0 — 1 , ,, 

^ Pi = -1, - 24 

2 • o 


Now, combination of (|^ and (24) derives: 


1 — o + / 3 i-o = l — o + 


3-0 — 1 1 + 0 


(25) 


First ratio function of /x: combination of ratios (20) and (25) 


Ratio (20) is decreasing with o, while ratio (25) is increasing. Combination of them allows 
elimination o in order to get a first ratio that is only a function of /x. Equalizing (20) 

1 + /X 


and (25) gives: 


1-0 


1 + 0 


=► o = 


1 + 2 • /X 

2 • /X + 1 
4 • /X + 3 


and embedding (26) in ( 20 ) derives: 

1 + /i 


1-0 


1 + 2 • ^ 


o - ( 4 -^ + 3 ) = 2 - /x + l 


= 1 - 


2 • /X + 1 1 + /(X 


4 -/X + 3 1 + 2 - ^ 

2 + 3 -/X 

3 + 4 • /X 


(26) 


(27) 
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Second elimination of /3i: ratios Q and ( |11D 

Revisit ratio 0 and observe that its last term is negative when fj, ^ 1/2. 

On the other hand, ratio (27) is increasing with ^ and bounded below by 0.7 as long as 
/i ^ 1/2. We so seek an “interesting” ratio when fj, ^ 1/2 and, in this case (i-2-M)-C-a/2-/i ^ 0 
and can be omitted. 

Hence, combination of ratios (Q and 0, for jj, ^ 1 / 2 , leads to: 


{1 - fj.) + a ■ {2 + fi) - a ■ (3i 


= \ — a + j3i ■ a 


(1 — /r) + a • (2 + /r) — a • /?! = 2 — 2 • a + 2 • /?! • a 

q: • (4 + /i) — (1 + /i) 


=> 3 • /3i • q; — ~(1 + /r) + o • (4 + /i) /3i — 

Then, combining (Q and (28), derives this time: 


3 • a 


1 — a + j3i ■ a = 


3 — 3*q:To*(4-)- /x) — (f /^) ^ — /x T o • (IT /x) 


(28) 


(29) 


Second ratio function of combination of ratios (29) and (20) 


Equalizing (20) and (29) gives: 


Once again, ratio (20) is decreasing with a, while ratio (29) is increasing. Combination 
of them allows elimination a in order to get a second ratio exclusively function of 

(1 T /x) • (2 T /x) 1 T /X 

3 

(30) 


1 — a • 


iT/x 2 — /xTq:'(1T /x) 


a = 


1 T 2 • /X 
1 T 2 • /X 


2 • a • 


3- (1T2-/X) 


2 - ( 2 T^) 


and embedding (30) in (20) derives: 

1 — a 


1 T /X 


3 T /X 


1T2-/X 4t2-^ 


(31) 


Final ratio 


As noted above, ratio (27) increases with /x, while (31) decreases. The value of /x guar¬ 
anteeing equality of these ratios also gives a lower bound for them. This value is /x = 1/2 
and, with this value, both ratios become 0.7. I 
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